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1.  Introduction 

The  idea  that  the  underlying  mechanism  of  inheritance  is  the  transmission 
according  to  elementary  laws  of  probability  of  particulate  units  was,  of  course, 
due  to  the  genius  of  Mendel.  If  one  of  the  parents  of  an  individual  only  has 
an  A  gene  and  an  offspring  has  an  A  gene,  then  the  A  gene  of  the  offspring  is 
a  copy  of  the  A  gene  of  the  particular  parent,  and  the  two  A  genes  are  identical 
by  descent.  Mendel  knew  this,  even  though  he  did  not  use  precisely  these  words 
to  express  the  fact.  Early  in  the  rediscovery  of  Mendel’s  work,  mathematicians 
or  statisticians  or  biometricians  (whatever  one  wishes  to  call  the  individuals) 
were  impressed  by  the  mathematical  interest  of  the  Mendelian  system.  It  was 
realized  early,  of  course,  that  inbreeding,  that  is,  the  mating  of  related  indi¬ 
viduals  was  an  important  tool  for  the  understanding  of  genetic  phenomena. 
It,  therefore,  became  necessary  to  develop  some  of  the  theory  of  inbreeding. 
Pioneers  in  this  work  were  Robbins  and  Jennings.  But  the  great  advance  was 
made  by  Sewall  Wright  [7],  who,  it  appears  to  me,  singlehandedly  developed 
the  method  of  path  coefficients  to  enable  the  answering  of  many  important 
questions  of  inbreeding.  The  magnitude  of  Wright’s  contribution  stuns  my 
imagination.  It  is  true  that  much  of  his  work  was  algebraic  computation  and 
this  any  one  could  do.  But  the  erection  of  a  framework  for  the  computations 
was  a  great  intellectual  feat.  It  would  not,  however,  be  stretching  the  history 
to  say  that  to  most  of  the  world  the  method  of  path  coefficients  was  a  mystique, 
obviously  very  powerful  in  the  hands  of  its  inventor,  but  quite  obscure  to  many 
others.  Perhaps  the  cornerstone  of  Wright’s  work  was  the  coefficient  of  correla¬ 
tion  of  uniting  gametes  or  the  coefficient  of  inbreeding  F  and  Wright  developed 
knowledge  of  the  progress  of  F  with  various  systems  of  inbreeding. 

It  appears  to  be  the  great  contribution  of  Mascot  [5]  to  put  the  Wright 
ideas  into  a  form  which  was  more  readily  understood  and  communicable,  by 
introducing  the  ideas  of  genes  being  alike  by  descent  and  by  considering  in- 
breeding  purely  as  a  matter  of  probability  of  genes  being  identical  by  descent. 
To  fix  the  ideas,  let  a  diploid  individual  X  have  two  genes  a,  6  at  a  locus.  Then, 
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the  coefficient  of  inbreeding  F  is  the  probability  that  genes  a,  b  have  arisen  by 
the  copying  process  of  reproduction  from  a  single  gene  in  the  ancestry.  At  the 
same  time  Mal^cot  defined  the  “coefficient  de  parente”  (which  I  translated  as 
“coefficient  of  parentage,,)  of  two  individuals  X  and  Y  as  the  probability  that 
a  random  gene  of  X  and  a  random  gene  of  Y  at  the  same  locus  are  identical  by 
descent.  The  inherent  simplicity  of  these  ideas  is  conveyed  in  the  simple  for¬ 
mulae  for  diploids, 

Fxxy  =  rXY, 


(1.1) 


TA(BXC)  =  2  (rAB  +  Tac)- 


It  is  then  a  relatively  simple  matter  to  develop  the  progress  of  F  over  genera¬ 
tions  of  inbreeding.  The  great  bulk  of  the  elementary  results  is  given  in  my 
book  [4].  Of  course,  the  ensuing  result  is  a  function  of  the  status  of  the  base 
population  from  which  the  probabilities  are  calculated,  and  this  is  true  what¬ 
ever  mode  of  calculating  probabilities  is  followed.  The  criticisms  by  Fisher  [1] 
of  Wright’s  F  are,  therefore,  clearly  unfounded. 

Mal6cot  [5]  applied  these  ideas  also  to  the  progress  of  a  finite  population 
and  to  quantitative  inheritance.  Later  I  [3]  showed  that  the  covariance  of  two 
relatives  X  and  Y,  neither  of  which  is  inbred,  in  a  random  mating  population 
without  linkage,  with  respect  to  a  metrical  trait,  which  contains  a  random 
environmental  contribution,  is  equal  to 

(1.2)  2txyoa  +  Uxygd  +  X)  (2rxr)r(wxy)  V  a'lt, 

=  3 

where  rXy  is  given  above  and  Uxy  is  the  probability  that  the  genes  of  X  are 
identical  by  descent  to  the  genes  of  Y  at  a  locus,  and  o\,  and  so  forth,  are 
components  of  genotypic  variance.  More  recently  Harris  [2]  has  generalized 
this  result  to  the  case  when  the  individuals  X  and  Y  are  inbred,  that  is,  have 
nonzero  coefficient  of  inbreeding,  and  his  ideas  were  a  stimulus  to  some  of  the 
material  I  present  later. 

It  seems  to  be  inherent  in  the  concept  of  identity  by  descent  that  the  processes 
of  genetics  for  which  it  has  utility  are  processes  which  do  not  depend  on  the 
actual  identity  of  the  genes.  Thus,  each  gene  is  to  be  subjected  to  the  same 
probability  processes  as  every  other  gene.  In  contrast  to  this,  if  the  different 
genes  have  different  selective  values  (however  this  be  defined),  their  fates  in 
the  evolutionary  process  will  depend  on  their  actual  names.  So  the  idea  of 
identity  by  descent  has  so  far  been  applied  only  to  the  processes  of  a  genetic 
population  with  “neutral”  genes.  I  would  not  wish  to  imply  by  this,  that  I 
believe  there  are  any  such  “neutral”  genes.  It  was  thought  at  one  time  that 
there  were  genes  that  were  “neutral”  as  regards  “selective  fitness,”  but  every 
such  case  has  been  shown  later  to  be  fallacious.  Looking  back,  one  may  well 
wonder  how  the  idea  was  ever  judged  reasonable,  but  perhaps  I  am  using 
hindsight  which  is  always  better  than  foresight. 
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One  should  not,  on  account  of  the  above,  consider  that  a  theory  of  the  progress 
of  a  population  with  “neutral”  genes  is  irrelevant,  because  such  a  theory  gives 
a  base  of  reference  by  which  to  evaluate  the  lack  of  neutrality  of  genes. 

The  applications  of  the  concept  of  identity  of  a  pair  of  genes  to  regular  sys¬ 
tems  of  inbreeding  are  reviewed  extensively  in  my  book  [4].  The  aim  of  the 
present  paper  is  to  present  some  basic  ideas  of  the  application  to  the  progress 
of  finite  populations,  and  to  describe  some  extensions  of  the  basic  idea  to  the 
status  of  more  than  two  genes. 

2.  An  elementary  use  of  the  concept 

Suppose  we  start  with  a  diploid  population  of  size  N,  having  N i  A  genes  and 
N2  a  genes,  with  Ni  -f-  jV2  =  2 N.  Suppose  that  there  is  no  selection,  mutation, 
immigration  or  emigration.  Then,  the  fate  of  genes  is  determined  solely  by  the 
elementary  probability  processes  of  Mendelism.  Suppose  also  that  each  genera¬ 
tion  arose  by  mating  of  individuals  of  the  preceding  generation,  these  matings 
possibly  being  based  on  consanguinity,  but  random  within  any  consanguinity 
restrictions.  Suppose  also  that  we  can  calculate  on  the  basis  of  the  mating  sys¬ 
tem,  the  probability  that  two  genes  of  an  individual  are  identical  by  descent 
and  find  this  to  be  F,  say.  Then  the  probability  status  of  this  individual  is 
given  by  the  array 

(2.1)  F[M±AA+§aa) 

+  (1  -F)  {2Af(2iv  -  1)  Nl<~Nl  ~  1)AA  +  Ni(-N*  -  C™  +  2NlN2Aa\- 

That  this  is  so  follows  by  elementary  arguments  because  the  two  events :  (i)  the 
genes  of  an  individual  being  alike  by  descent  and  (ii)  the  genes  of  an  individual 
being  unalike  by  descent,  are  mutually  exclusive,  with  probabilities  F  and  1  —  F, 
respectively.  In  the  absence  of  any  selection,  the  conditional  probability  that  a 
single  gene  from  which  a  pair  are  both  descended  is  an  A  gene  is  equal  to 
Ni/2N,  and  that  it  is  an  a  gene  is  N2/2N.  If  the  genes  are  copies  of  two  different 
genes  in  the  base  population,  the  probability  that  both  are  A  genes  is 
Ni(Ni  —  l)/2N(2N  —  1),  and  so  on. 

This  argument  gives  then  the  probability  status  of  a  random  individual  with 
coefficient  of  inbreeding  F.  It  is  to  be  noted  that  it  cannot  be  used  to  give  the 
probability  status  of  two  individuals.  In  fact  the  argument  gives  only  the  status 
of  two  genes  united  in  the  particular  individual. 

If,  of  course,  N\  and  N2  are  both  indefinitely  large  with  N\/2N  =  p  and 
N2/2N  =  q,  the  probability  status  of  an  individual  is 

(2.2)  FpAA  +  qaa  +  (1  —  F)p2AA  -{-  2pqAa  +  q2aa. 

The  generalization  to  k  types  of  genes  is  quite  obvious. 
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3.  A  finite  dioecious  population 

Suppose  we  have  a  finite  population  of  N  individuals,  and  that  each  member 
of  a  succeeding  generation  arises  by  random  mating.  Let  the  individuals  in 
generation  t  be 

(3.1)  A\\A\2,  A21A22,  •  •  •  >  A^iA1^. 

A  possible  mating 

(3.2)  AhAh  X  Ai'iAl'2 
gives  the  offspring  array 

(3.3)  -  \_AiiAi'i  -f-  A\\A\>2  +  AvAyi  +  AaAn], 

so  that  the  inbreeding  coefficient  Ft+i  of  any  individual  in  generation  (t  +  1)  is 

(3.4)  N(N-  1)  It'  PfiA‘lj  =  free’ 

t 

which  is  the  average  coefficient  of  parentage  of  two  individuals  in  generation  t. 
So 

(3.5)  Ft+i  =  Rt. 

An  obvious  argument  gives 

(3.0)  Rl+l  =  ^  i  (1  +  F 1)  +  N(N  ~  1)*.], 

and  the  well  known  conclusion 


follows.  This  argument  seems  superior  to  that  presented  in  my  book  [4]. 

4.  A  finite  haploid  population  with  random  viability 

A  common  way  of  handling  the  two  conflicting  aspects  of  finiteness  of  popu¬ 
lation  and  variability  of  number  of  offspring  per  individual  is  to  follow  a  con¬ 
ditioning  argument.  One  may  suppose,  for  instance,  that  the  individuals  of  the 
finite  population  produce  offspring  independently  according  to  a  Poisson  distri¬ 
bution,  and  then  consider  the  probabilities  of  numbers  of  each  type  conditional 
on  the  total  offspring  population  being  equal  to  the  population  size  N.  This 
process  is  mathematically  workable  and  is  a  way  of  introducing  reproductive 
competition.  I  find  some  obscurity  in  understanding  the  effect  of  this  condition¬ 
ing  on  the  distribution  of  actual  number  of  progeny  in  the  population,  though 
presumably  this  can  be  worked  out. 

An  alternative  easy  approach  with  regard  to  homozygosity  can  be  worked 
out  by  means  of  the  concept  of  identity  of  genes  by  descent  in  the  following  way. 
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Consider  the  population  at  generation  t  with  individuals 

(4.1)  A\,  A2,  •  •  •  ,  A*n. 

Take  a  distribution  of  a  nonnegative  random  variable  X.  Let  N  random  values 
be  X\,  X2,  •  •  •  ,  xn,  and  let 

(4.2)  P.  - 

Let  the  probability  that  an  individual  in  generation  t  +  1  arises  from  individual 
A i  be  pi,  from  individual  A2  be  p2  and  so  on.  Label  the  individuals  in  generation 
t  +  1  as 

(4.3)  AT,  AT,  ■■■  ,  A^\ 

Then,  with  independent  drawing  for  each  individual  of  generation  ( t  -f  1),  we 
have 

(4.4)  P{AT  =  AT}  =  E  pf  +  E  PiVi'P{A\  =  A}}, 

i  i,»' 

where  the  P  are  probabilities  of  genes  being  identical  by  descent.  The  left  side 
is  the  same  for  all  pairs  of  individuals  after  the  first  generation,  so  we  can  write 

(4.5)  Fl+i  =  E  p2i  +  [1  —  Z  Pi]  Ft 

i 

or  with  P  =  1  —  F,  so  that  P  is  the  panmictic  index  of  Wright, 

(4.6)  Pt+1  =  (1  ~  E  P?)Pt. 

Hence,  if  M2  is  the  second  moment  about  the  origin  of  the  “viabilities,”  the  pi} 
we  have 

(4.7)  Pt+1  =  (1  -  N^)Pt, 

a  very  simple  recurrence  equation.  In  the  special  case  when  all  p,  equal  1/N, 
the  equation  is 

(4.8)  Pl+,  =  (l 
which  is  very  well  known. 

5.  A  simplified  two  niche  problem 

I  am  indebted  to  E.  Poliak  for  the  following  example  and  some  ideas  about  it. 
The  progress  of  a  population  divided  into  two  parts,  or  niches,  between  which 
there  is  random  migration  is  of  some  general  genetic  interest.  Suppose  the 
generations  do  not  overlap,  and  that  in  niche  1  in  generation  t,  the  genes  are 

(5.1)  An,  A12,  •  •  •  , 
and  in  niche  2  the  genes  are 

(5.2)  A 21,  A-22,  •  •  ■  ,  A 2 A's • 
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Suppose  a  gene  for  niche  1  in  generation  (t  +  1)  arises  with  probability  (1  —  p) 
from  niche  1,  and  with  probability  p  from  niche  2,  and  similarly  for  a  gene  in 
niche  2.  Define, 

Ff  —  probability  that  two  genes  in  niche  1  in  generation  t  are  identical  by 
descent, 

Ff  =  same  for  two  genes  in  niche  2, 

R 1  =  probability  that  a  gene  in  niche  1  and  a  gene  in  niche  2  in  generation  i 
are  identical  by  descent. 

Then,  with  no  selection  of  any  sort,  the  probability  array  of  a  gene  in  niche  1 
in  generation  t  +  1  is 

(5-3)  (1  —  p)  An  +  A12  +  •  •  •  +  A‘imJ 

+  p  A‘2l  +  L  ^  +.  •  •  •  +  L  itu]. 


Hence,  P{A\tl  =  Ait1},  the  probability  that  two  genes,  labeled  for  conven¬ 
ience  as  u  and  v,  in  niche  1  in  generation  (t  +  1)  are  identical  by  descent  is 
equal  to 


(5.4) 


(1  p)i  JV.  +  (1  -  vY-  -V|lAj.r  ‘V'.  +  ^A’. 


m 


Ni 


Nl 


+  p 2 


N2(N2  -  1) 

Nl 


F‘2  +  2(1  -  p)pRK 


A  similar  expression  with  the  subscripts  1  and  2  interchanged  holds  for  a  pair 
of  genes  in  niche  2.  Also, 

(5.5)  JR*+i  =  =  Ait1} 


(1  ~  P)P 
N\ 


Ni- f- 


(1  -  p)p 


N\ 


NiiNi  -  l)F{  + 


(1  -  v)p 

Nl 


N2 


+ 


(1  ~  P)P 
Nl 


N2(N2  -  1)F2  + 


p2  +  (i  —  pY 
N1N2 


N1N2RK 


If  now,  as  is  advisable  in  all  calculations  of  this  type,  we  let  P  =  1  —  F, 
S  =  1  —  R,  we  have 


PS+ 1  =  - - - -  PS  +  Pi  +  2p(l  -  p)S‘, 


(5.6)  Pi*'-  pHN'N~1]  PS  +  ^ ^  Pi  +  2p(l  -  p)S\ 


s,+1  =  pH — pMj. — L)  p>  +  HI — PHHi — 1)  p;,  +  [y  +  (i-  p)2]s>. 


So  we  have  a  simple  recurrence  relation  for  the  three  quantities  Pi,  P2  and  S, 
which  can  be  solved  for  any  choice  of  N i,  N2  and  p. 

We  consider  here  the  solution  of  a  special  case  only.  We  shall  calculate  the 
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values  for  P,  R,  S  with  reference  to  a  base  population  for  which  we  take  Pi,  Pi 
and  S  to  be  unity,  of  course,  and  for  which  Ni  equals  N2.  Obviously,  under  these 
circumstances  P\  =  P\  =  Pl,  say,  so  the  recurrence  equations  reduce  to  two 
only,  namely, 

p- H  =  [(1  -  vY  +  p2]  (l  -jf)p‘  +  2p(l  -  P)S‘, 

(5.7) 

5'+'  =  2p(l  -  p)  (l  -  ^)  P‘  +  [p2  +  (1  -  p)2]S‘. 

The  characteristic  equation  is 

[(1  -  p)2  +  p2]  ^1  -  -  X  2p(l  -  p) 

(5.8)  =  0 
2p(l  -  p)  (l  -  jj)  (1  -  p)2  +  p2  -  X 

or 

(5.9) 

X*  -  X  (2  -  i)  [(1  -  p)2  +  p2]  +  (l  -  i)  {[(1  -  p)2  +  p2]2  -  4p2(l  -  pYi 

=  0 

=  X2  -  X  (2  -  i)  [1  -  2p(l  -  p)]  +  (1  -  1)  [1  -  4p(l  -  p)]  =  0. 

The  roots  Xi  and  X2  are,  therefore, 

(5.10) 

(1-2]v)  [1-2P(1-P)] 

±  ((l  ~  J^)  [1  -  4p(l  -  p)  +  4p2(l  -  p)2]  -  (l  -  jj)  [1  -  4p(l  -  p)]| 

=  0-2 b)  [1-2p(1-p)] 

111  {l1  ~  +  4V!)  ^  ~~  4p^  _  +  4p!^  -  p ^ 

-  (1  -  j^)  [1  -  4p(l  -  p)]}‘,! 

=  0  _  m) [1  ~ 2p(1  _  p)] 

±  ((l  -  j^)  4p2(l  -  P)2  +  [1  -  4p(l  -  p)  +  4p2(l  -  p)2])  . 

P*  and  S‘  are,  therefore,  of  the  form 

(5.11)  flXi  +  6X2. 
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A  somewhat  simple  answer  is  obtained  if  one  assumes  that  p  equals  c/N  and 
is  small,  so  that  a  small  number  of  genes  migrate  from  one  niche  to  the  other 
to  produce  the  next  generation.  In  this  case  the  larger  root  is 

<5-i2> x'  -  o  -  h)  [>  - 1 + °  (355)] + [% + ^ + °  (wr 

which  is  approximately  equal  to 

(5.i3)  1  -  m [1  +  4c  _  a  +  1Cc!)1,2]- 

This  result  is  related  to  the  results  of  Moran  [6].  Other  results  in  connection 
with  population  subdivision  have  been  obtained  with  this  type  of  argument  by 
my  colleague  E.  Poliak. 

6.  The  consideration  of  triples,  quadruples,  and  so  on,  of  genes 

Just  as  one  can  consider  the  probability  that  a  pair  of  genes  are  identical  by 
descent,  one  can  consider  the  status  of  a  triple  of  genes,  or  a  set  of  four  genes 
and  so  on.  Harris  [2],  in  his  generalization  of  the  work  on  covariance  of  two  rel¬ 
atives,  X  and  Y  with  genes  say  xa,  xd,  for  X,  and  ye,  ya  for  Y,  the  subscripts  s  and 
d  denoting  genes  received  from  sire  and  dam,  respectively,  found  it  necessary  to 
consider  all  the  possible  configurations  with  regard  to  identity  by  descent  of  the 
four  genes.  The  possibilities  are  as  follows  in  which  vertical  bars  separate  groups 
of  genes  identical  by  descent 

x,  xd  y*  ya  x ,  xd\ys\  yd 

xa  ya  |  xd  |  yd 
x  s  xd  y<  |  yd  xs  yd\xd  \  ya 

x 4  xd  yd  [  y,  xd  y,\x,\  yd 

xs  ys  yd  |  Xd  xd  y*\x,\  ys 

xd  y>  yd !  xs  y,  yd  \xt  \  xd 

xs  xd  |  y,  yd  Xs\xd\yg\  yd . 

x,  ys  |  Xd  yd 

Xs  yd  |  xd  ys 

So,  taking  account  of  origin  of  the  genes  in  an  individual,  there  are  15  different 
possibilities,  and  a  complete  accounting  of  the  probability  status  of  two  indi¬ 
viduals  would  require  the  determination  of  the  probabilities  of  the  15  different 
possibilities. 

The  calculus  of  probabilities  of  two  genes  being  identical  by  descent  is  very 
easy,  as  I  have  already  mentioned.  The  general  calculus  of  probabilities  with 
regard  to  four  genes  will  not  be  easy.  A  beginning  was  made  by  Harris  [2], 
and  further  work  is  in  progress. 

I  believe  it  is  interesting  to  relate  the  above  to  Fisher’s  working  out  of  the 
progress  under  full-sib  inbreeding.  If  for  ease  of  typography  and  reading  we 
denote  the  two  genes  of  the  male  by  ab  and  of  the  female  by  cd,  and  if  we  use 
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the  same  letter  out  of  a,  b,  c,  d  to  denote  genes  identical  by  descent,  the  possible 
configurations  are 


aa 

X 

aa 

(1) 

aa 

X 

ab 

(4) 

aa 

X 

ba 

(4) 

ab 

X 

aa 

(4) 

ba 

X 

aa 

(4) 

aa 

X 

bb 

(3) 

ab 

X 

ab 

(2) 

ab 

X 

ba 

(2) 

aa 

X 

be 

(5) 

ab 

X 

ac 

(6) 

ab 

X 

ca 

(6) 

ba 

X 

ac 

(6) 

ab 

X 

cb 

(6) 

be 

X 

aa 

(5) 

ab 

X 

cd 

(7). 

As  regards  configurations  for  an  autosomal  locus,  the  15  different  possibilities 
reduce  to  the  7  mating  types  for  which  Fisher  did  an  extensive  analysis.  So, 
as  is  after  all  obvious,  Fisher’s  work  based  on  the  study  of  likeness  of  four 
genes- is  in  the  same  spirit  as  Wright’s  study  of  the  likeness  of  two  genes,  and 
it  is  clearly  foolish  to  use  one  piece  of  work  to  belittle  the  other. 

7.  The  progress  of  a  finite  monoecious  population 

Let  the  individuals  in  generation  t  be  denoted  by 
(7.1)  A\\A\2,  A21A22,  •  •  •  , 

Then  with  random  mating,  including  selfing  with  the  appropriate  frequency, 
every  gene  in  generation  t  -f  1  is  an  independent  member  from  the  probability 
array 

(7-2)  .4-'  =  ^  £  A‘u  =  L  Z  A'„,  M  =  2 N. 

Let 

P2 ,t  =  Ft  denote  P{A\}  =  A\>j>},  which  will  be  the  same  for  all  pairs 

(ij)  5*  (i'f), 

Pz,t  denote  P{A\j  =  Avy  =  which  will  be  the  same  for  all  triples 

which  are  unequal, 

Pi,t  denote  P{A\j  =  Alvy  =  A\»r  =  Alt-r»} ,  the  same  for  all  unequal  quad¬ 
ruples. 

It  is,  of  course,  well  known  that 


1 


1 
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For  triples  of  genes,  there  are  M3  possible  choices;  in  M  of  the  choices  the 
same  gene  will  occur;  in  SM(M  —  1)  cases,  one  gene  will  occur  twice  and  the 
other  once;  and  in  M(M  —  1  ){M  —  2)  cases,  three  different  genes  of  the  pre¬ 
vious  generation  will  occur.  So 


(7.4) 
or 

(7.5)  P3.h-i 


Pz.t+i  =  Jp  \M  +  3 M(M  -  1  )P2(t  +  M(M  -  1  )(M  -  2)Pz.t] 


1  (M  — 1)  {M  -  IXM  -  2) 

M2  +  6  M2  +  M2  1 i't’ 


In  an  obvious  way, 

r  m  P  1  ,  7  (M  -  1)  (M  -  1  )(M  -  2) 

u .bj  4,t+l  ~  +  '  — Jp —  P*.t  +  b - ^ - I'z.t 

,  (^  “  1)(M  -  2)(M  -  3)  „ 

1 - - r4,f 


If  now,  we  let  P*  =  1 
(7.7) 


M3 

P,  we  have  the  simple  recurrence  relations 


PI 

Pt 

Pt 


t+i 


I  - 

3 

7 


M 

(M  -  1) 
M 2 

(M  -  1) 
M3 


6 


(M  -  1  )(M  -  2) 

M2 

(M  -  1  )(M  -  2)  (M  —  l)(M  -  2)(M  -  3) 


M3 


M3 


Pt 

Pi 

pi 


where  the  subscript  represent  the  generation.  The  roots  are  obviously 

<7-8)  1  -  h  (‘  -»)('- 1)  0  -  ii)  0  ~  h)  ('  - 1) 


or  if  we  let 


(7.9) 


t  =  1,  2,  •  •  •  ,  M  -  1, 


the  roots  are  81,  8182,  818283.  The  matrix  of  the  recurrence  equations  then  becomes 


(7.10) 


G  = 


'5i 

35i 

M 

7 8y 
M2 


0 

8182 

it 


0  " 
0 

818283 


which  has  a  particularly  simple  form. 

The  relation  between  P2,  P3,  Pt  in  successive  generations  is  then  given  suc¬ 
cinctly  by 
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/Pt\  /Pt\ 

(7.11)  *1+1  =  I  PS) 

Wm  W« 

To  determine  the  vector  6  for  any  generation  we  find  a  matrix  C  such  that 


(7.12)  CGC-1  =  diag  (8h  8iS», 

With  some  elementary  manipulation,  it  turns  out  that 


(7.13) 

and 

(7.14) 

It  follows  that 

(7.15) 

and  in  fact, 

(7.16) 


C~l  = 


4/ (1  -  52) 
7  +  11«2 


|_4/2(1  -  62) (1  -  52<53)  4/(1  -  83) 


c  = 


1 

-3 


M(  1  -  52) 
11  +  783 


0 

1 

-6 


[_M\l  -  S3)(l  -  8283)  4/(1  -  83) 


Gl  =  C~l 


‘8\  0 

0  («A)‘ 


0 


0 


0 
0 

(818283)  'J 


c, 


G<  =  8\ 


M(  1  -  S2) 
7  +  1152 


LM2(1  -  52)(1  -  5253) 
0 

-3 


0  O' 

0  0 

0  0 


+  («A)‘ 


4/ (1  -  82) 
-18 


LM2(1  -  52)(1  -  82)  4/(1  -  53) 


O' 

0 

0 


+  (5i5253)j 


0 

0 

11  +  7  83 

_4/2(  1  -«*)(!  -  8282) 


0  O' 

0  0 


4/(1  —  53) 
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Si  0  0  “ 

3Si-3SiSi 

M(  1  -  S2)  0 

(7  +  lis2)si 

(7.17)  G*  =  M*(  1  -  S2)(l  -  S2S3) 

—  185iS2  CSiSi  -  6SiSiSi 

M\  1  -  S2)(l  -  S3)  3/(1  -  S3)  123 

+  (11  +  7S3)SiSjSi 
JP{  1  -  S3)(l  -  S2S3) 

If  then  we  measure  the  progress  of  likeness  by  descent  of  genes,  we  take  the 
base  population  to  consist  of  different  genes  and  the  initial  values  of  P2,  P3,  P\ 
to  be  unity.  Therefore, 

Ph  =  si, 


n‘  -  71/(1-  h)  +  [!  “  7i/(i-  a,)]  s'si 


(7.18) 


P*  =  7  +  11S2  , 

4,<  1  -  S2)(l  -  S2S3)  1 


6  18 
M{  1  -  S3)  il/2(l  -  S2)(l 

11  +  7  S3  “1 


1  r  1 _ 6  1 _ 11  +  7s3  ~i  t , , 

+  L  M(l  -  «,)  +  MH\  -  Ss)(l  -  «A)J 

~  \  5  -  WM  )  Sl  ~  M 2  +  V5  -  6/iW )  5'SA- 

A  pair  of  genes  can  have  one  of  the  two  states:  identical  by  descent  with 
probability  P2  (called  F  earlier  in  this  paper);  and  nonidentical  by  descent 
Pm  ( P  earlier).  For  a  triple  of  genes  the  possibilities  are  all  three  alike  with 
probability  P3,  two  alike  and  one  different  P2/i,  all  different  Pi/1/1.  It  is  clear  that 


(7.19) 


P2  —  P3  +  Q  P  2/1; 


SO 

(7.20)  P2/1  =  3 (P2  -  P3)  =  3(P3  -  P2), 

and 


(7.21)  P 1/1/1  —  1  —  P3  —  P2/j 

=  1  +  2P3  -  3  P2 
=  3P2  -  2 PJ. 

In  the  case  of  a  quadruple  of  genes,  things  are  not  as  simple,  and  the  probabili¬ 
ties  we  need  for  complete  description  are 
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(7.22)  Pi,  P3/1,  P 2/2,  P2/111,  and  Pi/1/1/1. 

It  is  clear  that 

(7.23)  P3  =  P 4  ^  P 2/1, 

so  we  know  Pm  for  any  generation.  The  others  are  obtainable  only  with  supple¬ 
mentary  computations.  We  find  that 

P‘2/2  =  Pl/1  +  P2/I  +  61 8283  P2/2, 

(7.24) 

P2/1/1  =  -  m  2  P\/\/i  +  P 2/1/1. 

Hence, 

PI  +  ~  (PS  -  PI)  +  MA  PI/2, 

(7.25) 

P„„.  =  ^  (3P|  -  2 P|)  +  5,5283  PI/1/1. 


It  follows  that  the  complete  specification  of  the  status  of  two,  three,  or  four 
genes  is  given  by  the  equation 

(7.26) 


We  note  that  consideration  of  the  other  possible  configurations  of  four  genes 
causes  repetition  of  the  root  616253.  A  little  algebra  will  yield  the  tih  iterate  of 
the  coefficient  matrix. 

The  generalization  of  the  above  to  the  case  of  fcplets  of  genes  is  direct,  though 
there  are  complexities  of  algebra.  Essentially,  what  is  involved  is  the  making 
of  k  independent  draws  from  an  array  of  M  objects.  There  are  Mk  possible 

results,  and  if  we  denote  by  j^fj,  the  number  of  unordered  partitions  of  k  into  r 

parts,  the  number  of  results  in  which  the  k  draws  arise  from  r  different  objects  is 

(7.27)  M(M  -  1)  •  •  •  (M  -  r  +  1). 
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It  follows  that  the  recurrence  equation  for  Pt,  Pt,  •  •  •  ,  Pt  is 

(7.28) 


fa 


fa 

M 

A 

M2 


0  0 


8182  0 

4“J  8182  t 

.3J  M  SlWs 


~k~ 

fa 

~k~ 

8182 

~k~ 

_2_ 

Mk~ 2 

.3. 

Mk~ 3 

_4_ 

818283 

Mk=l 


or 


(7.29)  <£<+ 1  — 

It  appears  that  the  matrix  C  such  that 

(7.30)  CHC~l  =  diagonal 

has  a  nice  structure  in  terms  of  5i,  fa,  •  •  •  . 

Some  aspects  of  the  progress  of  a  finite  dioecious  population  are  being  devel¬ 
oped  by  the  same  approach,  but  I  do  not  yet  have  the  results. 


8.  fcplets  of  genes  in  a  finite  haploid  population  with  random  viabilities 

The  arguments  of  previous  sections  may  be  combined.  If  we  let  P3t  be  the 
probability  of  three  genes  being  identical  by  descent,  we  have 


(8.1) 

Pst+I  =  Zpl  +  3  E  piPi'  Pu 

+  PiPi’Pi"  Pit. 

tvV 

tV’iVt" 

Let 

Zp!  = 

i 

Nn  fz, 

(8.2) 

II 

AV21, 

i*i' 

L  PiPi'Pi"  = 

N*n[  11, 

and 

(8.3) 

11 

E 

1  -P3. 

Then, 

(8.4) 

■Plu+D  — 

3AV21  Pit  +  N*n fn  Pt, 

Also, 

(8.5) 

Pfat+D  = 

(1  -  Nul)  Pit. 

IDENTITY  OF  GENES  BY  DESCENT 


347 


It  follows  that 


(8.6) 

if 


Ptu+i)  —  yPZ(t+\)  —  3N2n2iP2t  +  N3nmPtt  —  y[l  —  Ny/HPlt 
=  hi  [Pat  —  7P2/] 


(8.7) 
or  if 
(8-8) 


NVniy  =  7(1  —  Npz)  —  3A/"V2i 


3  NV21 

1  —  Nn  2  —  N3n'ui 


Hence,  P3  —  7P2  decreases  in  the  ratio  N3n'n  1  in  each  generation. 

The  extension  to  /cplets  of  genes  is  clear,  though  the  algebra  becomes  very 
tedious.  In  general,  there  is  a  form 


(8.9)  Pi  —  yk,k-i  Pi-i  —  yk,k-2  Pi-2  —  •  •  •  —  7^,2  Pi 

of  the  probabilities  which  decreases  in  each  generation  on  the  ratio  Nkn'u...  1, 
where 


(8.10)  Nkn h-.-i  =  E  PuPii  •  •  *  P»- 

tlfiS,  •  *  *  ylk 

unequal 

If  all  pi  equal  l/N,  the  ratio  is  N&)/Nk  as  we  already  knew. 

It  may  be  noted  in  passing  that  the  incorporation  of  mutation,  at  least  in 
one  form,  presents  few  difficulties. 


9.  Concluding  remarks 

The  determination  of  the  consequences  of  Mendelism  with  random  mating 
whether  in  a  haploid  population,  a  monoecious  or  dioecious  population  is  a 
matter  of  algebraic  computation.  By  so  characterizing  the  work,  I  do  not  wish 
to  be  derogatory.  The  computations  can  become  very  excessive  and  very 
tedious,  particularly  if  one  wishes  to  develop  the  probability  status  of  the 
whole  population.  In  one  way  or  another,  this  involves  the  determination  of 
the  roots  and  both  left  and  right  eigenvectors  of  the  particular  probability 
transition  matrix  that  is  a  consequence  of  the  whole  system.  This  proves  to  be 
very  difficult,  though  Drs.  Karlin  and  McGregor  have  made  notable  progress. 
The  purpose  of  my  own  presentation  is  to  show  that  if  one  is  satisfied  with  a 
partial  picture  of  the  progress  of  the  population,  one  may  well  be  able  to  do  so 
by  using  the  concept  of  identity  of  genes  by  descent. 

From  a  genetic  viewpoint,  I  am  inclined  to  think  that  a  working  out  of  limit¬ 
ing  distributions  using  only  the  largest  root,  which  is  frequently  of  the  form 
(1  —  l/N )  are  of  interest  only  mathematically.  The  common  case  is  that  the 
roots  are  very  close  as  (1  —  l/N),  (1  —  l/iV)(l  —  2/N),  (1  —  1/A0(1  ~  2 /N) 
(1  —  3 /N),  and  so  on.  It  would  seem  that  the  length  of  time  required  for  the 
situation  to  be  describable  reasonably  in  terms  of  the  largest  root,  would  be 
such  that  the  model  can  no  longer  be  trusted  unless  mutation  is  included.  If 
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the  same  mutation  can  occur  more  than  once,  I  suspect  that  a  solution  based 
on  only  the  largest  root  can  be  quite  misleading  at  least  for  populations  of  the 
size  that  occur  in  Nature.  This  situation  with  regard  to  the  regular  inbreeding 
systems  commonly  used  would  appear  to  be  somewhat  different.  It  is  evident 
that  many  difficult  problems  in  the  dynamics  of  Mendelian  populations  remain. 

It  has  been  brought  to  my  attention  that  some  work  on  similar  lines  was 
done  contemporaneously  be  Michel  Gillois  in  a  thesis  entitled,  “La  relation 
d’identit^  en  g6n6tique,”  Faculty  of  Sciences  of  the  University  of  Paris  [8]. 
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